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O ABSTRACT 

The complete 10-year survey from the Large Synoptic Survey Telescope (LSST) 
will image ~ 20,000 square degrees of sky in six filter bands every few nights, bringing 
the final survey depth to r ~ 27.5, with over 4 billion well measured galaxies. To take 
full advantage of this unprecedented statistical power, the systematic errors associ- 
ated with weak lensing measurements need to be controlled to a level similar to the 
00 statistical errors. 

This work is the first attempt to quantitatively estimate the absolute level and sta- 
tistical properties of the systematic errors on weak lensing shear measurements due to 
the most important physical effects in the LSST system via high fidelity ray-tracing 
\Q simulations. We identify and isolate the different sources of algorithm-independent, 

C^) additive systematic errors on shear measurements for LSST and predict their impact 

on the final cosmic shear measurements using conventional weak lensing analysis tech- 
niques. We find that the main source of the errors comes from an inability to adequately 
characterise the atmospheric point spread function (PSF) due to its high frequency 
• »"H spatial variation on angular scales smaller than ~ 10' in the single short exposures, 

/\ which propagates into a spurious shear correlation function at the 10 -4 -10 -3 level on 

J-j these scales. With the large multi-epoch dataset that will be acquired by LSST, the 

stochastic errors average out, bringing the final spurious shear correlation function 
to a level very close to the statistical errors. Our results imply that the cosmological 
constraints from LSST will not be severely limited by these algorithm-independent, 
additive systematic effects. 

Key words: cosmology: observations - gravitational lensing - atmospheric effects - 
surveys: LSST 



1 INTRODUCTION 

Weak gravitational lensing, or weak lensing for short, is one 
of the most powerful tools for probing dark matter and 



dark energy <|Albrecht et al. 20061. Distorted by interven- 



ing large-scale structures, the otherwise randomly oriented 



galaxy images encode signatures of dark matter and dark 
energy in a statistical way, namely through cosmic shear. 
For a review of weak lensing, see, for example, |Bartelmann| 



& Schneider ( 2001 hereafter BS01) . The lensing power spec- 



trum provides a unique tool to distinguish between different 



cosmological models (Jain & Scljak 1997 Kaiser 1998 Hu 
fc Tegmark|1999 |. 
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Since the first detections of the cosmic shear signal by 
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several independent groups (Wittman et al. 2000 Bacon 
et al.|2000 Kaiser et al.|2000 |, there has been an explosion 
of research activity in this field. The most recent analyses 
have shown that state-of-the-art weak lensing surveys are 
already probing interesting regions of the dark energy pa- 
rameter space ( Semboloni et al.|2006| Benjamin et al.|2007 



Hetterscheidt et al.||2007[ |Schrabback et al.||2010[|Lin eTaL] 



2011||Huff et al.|2011[ ). However, a major limitation of these 
existing surveys has been their relatively small sky coverage, 
which results in an insufficient number of galaxies to aver- 
age out their random shapes and orientations (i.e. to reduce 
the so-called "shape noise"). Cosmic shear measurements to 
date are limited by such statistical errors. 

As a result, several projects are attempting to over- 
come this fundamental limitation by significantly increasing 
the sky coverage. The Dark Energy SurveM M the Kilo De- 
gree Surve£J Hyper Suprime Carn^J LSS'ff||Tyson||2002 1 



and Euclicr] projects have all been explicitly designed for 
weak lensing investigations. The primary improvement of 
these projects over previous ones is that the cameras they 
incorporate have very large fields of view, which leads to a 
much larger survey area and a dramatic improvement in the 
statistical power of the dataset (Amara & Rcfregicr 20071. 
When statistical errors become negligibly small in these fu- 
ture surveys, systematics errors become a primary concern. 

For weak lensing, there are systematic errors associated 
with physical effects in the atmosphere and the telescope, 
and with imperfect algorithms used in the analysis. In this 
paper, we are mostly interested in quantitatively character- 
ising the former. Systematic errors due to imperfect algo- 
rithms are in principle reducible, and will certainly shrink 
as we gain experience with the near-term upcoming surveys. 
However, physical effects that are inherent to the system 
and independent of specific weak lensing algorithms are ir- 
reducible and most likely will determine the ultimate limits 
on cosmological constraints derived from weak lensing. 

In the past, the effects of different sources of system- 
atic errors on cosmic shear measurements have usually been 
calculated by assuming some hypothetical power spectrum 
for the spurious shear, often in simple functional forms for 



analytical calculations (Amara & Refregier 2008 hereafter 



AR08). However, these functional forms may not be well mo- 
tivated by physics. We make the first attempt to approach 
the problems in a bottom-up way and simulate the actual 
measurements to predict the level of systematic errors from 
first principles. We use LSST as our benchmark survey in 
this work, but many of the results are general or scalable to 
other future weak lensing surveys. The LSST Photon Simu- 
lator, or PhoSim ( jPeterson et al.|2009|[2012||ConnoHy et al.| 
20101 is used in this work to generate realistic LSST images 



for this study. In this way, we are able to measure quantita- 
tively the systematic errors generated from various physical 
effects in a controlled way. 

Note that in this paper we only discuss the case of ad- 
ditive shear systematics associated with the projected two- 
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point correlation function on a limited range of angular 
scales (within the field of a single focal plane). We do not 
consider weak lensing tomography (|Hu||I999|) or higher or- 
der statistics ( jSchneider &; Lombardi|2003| ISchneider et al. 



2005). The use of these other statistics can impose additional 



requirements on the level of systematic errors, but on the 
other hand, if the information is combined properly, it also 
has the potential of mitigating particular systematic errors 
that are only present in the projected two-point correlation 
function. 

The paper is organised as follows. A brief review of the 
canonical framework of weak lensing is given in Section [2] 
In Section [3] we present a short introduction to LSST and 
our simulation tool. In Section]!] we lay out a framework for 
classifying the different physical effects that induce errors in 
shape measurements. In Section [5] the different sources of 
errors and their correlation properties are quantified using 
simulations. Possible sources of spurious shear signals af- 
ter correcting for the PSF effects are discussed in Section [6] 
while the results from simulations are presented in Section 17] 
In Section [8] we discuss the prospect of combining multiple 
exposures, the implications for the determination of cosmo- 
logical constraints, and the effect of some of our assumptions 
on our results. We conclude in Section [9] 



2 WEAK LENSING NOTATION AND 
MEASUREMENTS 

In the presence of weak lensing, a galaxy image, having some 
intrinsic shape, is first sheared by the gravitational potential 
along the line-of-sight, then convolved with the atmospheric 
and instrumental PSF before being measured. As an ob- 
server, we want to reverse this process: measure the shape 
of a galaxy from a noisy image, correct for the PSF effects 
to infer the shear through an estimator, and finally calculate 
different statistics that are sensitive to cosmology using the 
shear estimator. For details on the weak lensing formalism, 
as well as predictions of weak lensing signals from different 
cosmological models, see BS01. 

For this work, following the steps in a data reduction 
process, we ask the following questions: (f ) How do the dif- 
ferent physical effects change the measured galaxy shape 
before any PSF correction has been made? (Section [4] Sec- 
tion [5f (2) To what level can these PSF effects be corrected 
to infer the correct shear using a conventional algorithm? 
(Section [6] Section [jj (3) With only the information from 
two-point shear correlation functions, how do the effects in 
(1) and (2) scale in the final combined dataset and what 
does that imply in terms of uncertainties in our predicted 
cosmological model? (Section [sj 



2.1 Weak lensing notation 

Throughout the paper we use the following definition for the 
complex "ellipticity spinor", e = Ei + 1E2, to parametrise the 
shapes of objects: 

In — I22 2I12 



£1 



hi + h 



£2 



hi + h 



(1) 



where hj are normalised moments of the object's light inten- 
sity profile f(xi, X2), weighted by a Gaussian filter W(xi, X2) 
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to reduce noise: 



3 LSST AND PHOSIM 



/ / dXldX2W(xi,X2)f(xi,X2)XiXj 
J J dxidX2W(xi,X2)f(x ll X2) 



hi = 1,2. (2) 



where the width of W(xi, X2) is chosen to give the maximum 
signal-to-noise ratio for each individual object. 

In this paper, if not otherwise specified, boldface sym- 
bols indicate the complex quantities and the magnitude of 
the complex quantity is specified using the corresponding 
regular-font symbol (e.g. s = \e\ = \J e\ + £%)■ A similar 
notation is used to parametrise shear 7, where we have 
7 = 7!+ 172. 

We also adopt the standard definitions for calculat- 
ing the correlation function xx (0) and power spectrum 
Cx(£): 

£+,xx(0) = {x t (e o )x t (e o +e)}+{x x (e o )x x (0o+0)) , (3) 



C x {£) = 2ir f 
Jo 



£4 



:(*) = 



2tt 



:(9)Jo(19) 



ducx(e)Meo) 



(4) 



(5) 



where X is a complex spinor (e.g. ellipticity e or shear 7) 
and the subscripts t, x indicate an isotropised decomposition 
of X along the line connecting a particular pair of galaxies. 
If X is measured in an arbitrary Cartesian coordinate sys- 
tem X = X\ + 1X2 with 1,2 denoting the two axes, then 
the rotated shear is calculated via X t = -Re{Xe~ 2iv ) and 
X x — —Im(Xe~ 2 '" p ), where ip is the argument of the vector 
connecting the pair of galaxies. The angle brackets (} indi- 
cate an average over all galaxy pairs separated by (with 
one galaxy located at some 80) ■ Jo is the zeroth-order Bessel 
function of the first kind. We will use £xx as shorthand for 
ti+,xx(9) for the rest of the paper. For the simulations in 
this work, we look at angular scales up to the scale of the 
full LSST focal plane (~ 3 degrees). 



2.2 Analysis tools 



In all of our analyses of the simulated images, we use the 
ickage 
object 



software package Source Extractor (Bertin & Arnouts 
19961 for object detection. We set the following con- 
figuration parameters: DETECT _MIN ARE A = 5 and 
DETECTTHRESH = 1.5. 

Background estimation, shape measurement, PSF cor- 
rection and shear estimation were done through the s oftware 
package IMC ATpjbased on the a lgorithm derived in 



et al. (19951, Luppino & Kaiser (19971 and Hoekstra et al 



Kaiser 



(19981, commonly known as KSB. The IMCAT parame- 



ters e[0], e[l], gamma[0] and gamma[l] correspond to the 
ellipticity and shear components ei, £2, 71 and 72 respec- 
tively, while the IMCAT parameter r g is used for the width 
of W(x\,X2) in Equation [5] Our specific implementation of 
KSB is similar to the "ES2" method in |Massey et al. | |2007 |. 
We describe briefly the KSB formulae in Appendix [B| 



The LSST survey will be the most powerful ground-based 
weak lensing survey planned for the coming decade. Its revo- 
lutionary scale will likely lead the next generation of optical 
survey designs. We therefore believe that using LSST as the 
target for this study will enable us to capture the most im- 
portant issues for future weak lensing surveys. 



3.1 LSST design parameters 

The optical design of LSST is optimised to cover as much 



sky as poss ible while maintaining good image quality (Ivezic 



et al. 



2008 1. The 8.4-meter aperture and the 9.5-degree field 
of view combine to an etendue of ~319.5 m 2 degree 2 , which 
is over 10 times larger than that of any previous survey 
facility. The heart of the instrument is a 64-cm-diameter, 
3.2-giga-pixel focal plane. The focal plane is tiled with 189 
CCD sensors, each with 4kx4k, 10 (jm square pixels (each 
pixel corresponds to an angular scale of 0.2"). The layout of 
the focal plane geometry is shown in Figure [T] 

Good image quality is one of the key components to 
weak lensing measurements. To ensure that over the entire 
survey period the instrumental effects that degrade the im- 
age quality are kept under control, LSST incorporates an 
Active Optics System (hereafter AOS), which adjusts the 
figures and positions of the three reflective optics and the 
orientation of the camera to correct the wavefront errors. 

LSST will take images approximately every 15 - 20 sec- 
onds, covering the entire available hemisphere every few days 
in six optical filter bands. Each of the 2 consecutive 15- 
second exposures (separated by 4-second readout and shut- 
ter open/close) is called a visit. The 2 exposures in a visit 
will be taken on the same field. From visit to visit, the tele- 
scopes will then point to different fields in order to achieve 
the very wide sky coverage. The 10-year survey will generate 
an unprecedentedly large amount of data (nearly two thou- 
sand 15-second exposures on each field across the 20,000 
degree 2 sky). For cosmic shear measurements, this means 
reducing the statistical errors from shape noise and cosmic 
variance by orders of magnitude. As a result, understanding 
the sources of systematic errors in these data will undoubt- 
edly be a major challenge for LSST. 



3.2 LSST observation parameters 

We extract from catalogues generated by the LSST Opera- 
tions Simulator ( Krabbendam et al.|2010 hereafter OpSim) 
information about the the observing conditions in an ex- 
pected LSST weak lensing dataset. OpSim simulates the at- 
mosphere and night sky conditions for individual exposures 
at the LSST site over 10 years based on weather models, 
telescope models and optimisation of the survey strategy. 

From previous work, it is known that only images with 
the best image quality contribute to the cosmic shear signal 
(Hoekstra et al. 20061. As a result, to estimate more ac- 
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curately the "typical" observing parameters for images that 
contribute to the final cosmic shear measurement for LSST, 
we take mediums of the major observation parameters in a 
subset of the full OpSim catalogue. This subset consists of 
the 50% of the r-band (552-691 nm) data that give the best 
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Table 1. Fiducial observing parameters. All parameters are spec- 
ified for the LSST r band (552-691 nm). These observing parame- 
ters are chosen to be typical for obtaining images for weak lensing 
measurements. The numbers are calculated from the medium val- 
ues of the best 50% r-band exposures. Note that a a t m includes 
the airmass contribution. 



3.5 degree Field of View 
(634 mm diameter) 



CCS +Z 



Parameter name 


Description 


Fiducial value 


latm 


atmospheric seeing 


0.56" 


lopt 


instrumental 
PSF FWHM 


0.42" 




airmass 


1.2 


^sky 


sky background 


640 counts/pixel 


^cxp 


exposure time 


15 seconds 


b 


Galactic latitude 


-60° 


N 


number of exposures 


184 



Figure 1. Layout of the LSST focal plane taken from I vezic et al.| 
l |2008[ l. The red and blue thin lines indicate the boundaries for 
each sensor, where the blue area is the 189 science sensors we 
use in our focal-plane size simulations. The ellipticity maps from 
our simulations shown in Figures[3]and[5]correspond to ellipticity 
values in these areas. 



image quality. From this process we define the "fiducial ob- 
serving parameters" for a 10-year LSST weak lensing dataset 
as listed in Table [l] Note that the parameters in Table [T] are 
specified (when applicable) for r band, while for real weak 
lensing analyses, i-band (691-818 nm) images are likely to 
be used as well. We carry out all the analyses presented 
here in r band, but extrapolate the results to i band (see 
Section [8. knowing that the image quality and observing 
parameters are similar in both bands. In Table [l] the pa- 
ra meter a opt is the de signed instrumental PSF size specified 



Ivezic et al. ( 201 l| p^ at the elevation that corresponds to 



the median airmass. 



3.3 PHOSIM 



To study in detail the expected systematic errors in weak 
lensing measurements for this survey, existing data from 
other projects are insufficient in both the data quality and 
quantity. As a result, simulations become the only way to in- 
vestigate such problems before the telescope is built. For this 
study in particular, simulations also enable us to trace the 
individual sources of systematic errors in a controlled and 
bottom-up fashion, which is almost impossible to achieve 
with real data. A few unique features of the simulation pro- 
cess in this work should be emphasised: First, all the physical 
effects that introduce systematic errors in the shape mea- 
surements can be separately turned "on" and "off". There- 
fore, we have full control over which actual physical effects 
are dominant in determining the image shape error. Second, 
the PSF at the location of a galaxy image can be known 
exactly by simulating an image with a bright source at that 



same location. Finally, all physical processes in these numer- 
ical experiments are reproducible. 

In an earlier attempt to simulate LSST as a complete 
system using a modified version of existing optics software 
( |Jee fc Tys on 2011), the potential power of studying these 
issues via simulations has been demonstrated. In this work, 
we take the analysis one step further and invoke P ho Sim 
as our primary tool for generating simulated images. Un- 
like the software used in |Jee fc Tyson| ( |2011| |, PhoSim is a 
set of custom-made software designed specifically to repre- 
sent the LSST's performance, and simultaneously incorpo- 
rate many aspects of the project design (e.g. data manage- 
ment software development and scientific studies). PhoSim 
adopts a photon-by-photon Monte Carlo fast ray-tracing al- 
gorithm, which generates images expected for LSST with 
very high fidelity. In collaboration with the LSST instru- 
mentation teams and multiple science groups, the PhoSim 
software has been continuously updated and cross-checked 
to track the most current hardware developments. 

PhoSim is part of the end-to-end LSST Image Simula- 
toi[^] (ImSim). ImSim simulates the forward process from cos- 
mological models to realistic astronomical images, in which 
PhoSim is responsible for the last part in this process - the 
photon propagation from top of the atmosphere down to 
the CCD sensors and the signal readout. ImSim begins with 
a catalogue of celestial sources based on large cosmological 
N-body simulations and detailed Milky Way and Solar Sys- 
tem models. A realistic observing environment is then set 
up by using parameters predicted by the OpSim catalogue. 
PhoSim then simulates the final "exposure" by tracing indi- 
vidual photons from objects in the catalogue for that part 
of the sky, through the atmosphere, the telescope, and into 
the camera to form an image that retains all the major char- 
acteristics we anticipate in the LSST data. 

We use PhoSim version 3.0 to do all the analyses in 
this paper. To provide sufficient background material to in- 
terpret the results, we describe briefly in Appendix [A] the 
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major physical models in PhoSim, and refer to Peterson 
et al. (20121 for further details. 



4 SOURCES OF ELLIPTICITY ERRORS 

Assume the PSF has some finite size a and we measure a 
PSF-convolved galaxy to have ellipticity e m , then this e m 
can be broken down to an intrinsic component, a shear com- 
ponent and an additional component from various physical 
effects associated with counting statistics, the atmosphere 
and the telescope/camera system. If we assume that all 
these three components are evaluated for the same measured 
galaxy size and we are only interested in the small changes 
in the anisotropy of the galaxy shap^] we can write out the 
following relation in linear additive terms: 



e H — 7 + e 

a 



(6) 



where the first term is the ellipticity of the galaxy convolved 
with a circular PSF, the second term is the change in e m 
due to shear and the last term is the change in e m due 
to other physical effects, a is a scaling factor to first "de- 
weight" the ellipticity calculated from the weighted moments 
(Equation[2j and then account for the effect of the finite-size 
PSF. The factor of 2 in the second term converts shear into 
ellipticity (BS01). For infinite resolution (PSF ~ delta func- 
tion) and ellipticities calculated from unweighted moments 
(W = 1), we have a = 1. For infinite resolution and ellip- 
ticities calculated from weighted moments, a is equivalent 

defined in 



to two times the shear responsivity PJ^ 



et al. ( 1998 1 



Hoekstra 



The correlation function for the measured ellipticity can 
thus be written out as: 



fa 



, £77 + £e 



+2(-e s « 7 + ~e y «. +c £ 

a a 



(7) 



4.1 



Non-stochastic and stochastic errors in 
ellipticity measurements 



Here we present a concept for classifying e s similar to that 
in |Jain et ah] ( |2006[ ) . This classification scheme is especially 
important for analyses of multi-epoch datasets such as LSST 
- this is the first step towards understanding the nature of 
different sources of systematic errors in shear measurements. 
Two major classes of physical effects combine to give e" . We 
use the terms "non-stochastic" and "stochastic" to refer to 
these two classes of errors. 

Non-stochastic errors are those that are either fixed in 
space and time, or vary with characteristic patterns over 
multiple exposures. Stochastic effects, on the other hand, 
induce errors that change randomly from exposure to expo- 
sure with no correlation in time. For non-stochastic errors, 
because they show repeated patterns from frame to frame, 



Table 2. Classification of the major physical effects that intro- 
duce errors in shape measurements. The difference between non- 
stochastic and stochastic optics errors is described in Section]?] 



Non-stochastic effects 


Stochastic effects 


Optics design 


Counting statistics 


Non-stochastic optics errors 


Stochastic optics errors 




Tracking errors 




Atmospheric effect^^j 



one does not benefit from averaging over multiple indepen- 
dent exposures; however, this repeating feature also means 
that they potentially can be characterised very well when 
one properly combines the multi-epoch dataset. Stochastic 
errors are exactly the opposite: randomness implies one can 
only model them with data from limited information in a 
single exposure, but via some form of averaging of the mul- 
tiple exposures on the same field, the errors are likely to 
cancel each other. 

In Table [2j we identify the major non-stochastic and 
stochastic effects that are modeled in PhoSim and are most 
likely to contribute to s s . We also provide in Appendix [A| 
brief descriptions of how each of these effects is modelled 
in PhoSim. There are some physical effects that may be 
present and are not yet modeled in the current PhoSim, but 
we believe they will not contribute significant. EThe effects 
listed in Table [2] should comprise the great majority of the 
sources of error for weak lensing measurements with LSST. 

In most existing weak lensing data, non-stochastic ef- 
fects dominate the error; therefore the origins of these sys- 
tematic errors are historically better understood. For exam- 
ple, [jarvjs^tjiL] ([2008} were able to model the PSF patterns 
of telescope aberrations with low order functional forms. 
Stochastic effects, being relatively small in existing data, 
have not been studied in detail. Only a few pioneering stud- 
ies have tried to understand the stochastic effects under sim- 



ple cases: Paulin-Henriksson et al. (20081 and Zhang (20101 



studied the noise contribution to shape measurement due 
to counting statistics and pixelation, while |De V rics et al. 
(20071 investigated the atmosphere-induced ellipticity and 



its time dependence. 

Note that this classification scheme is only valid under 
a well-defined survey since it depends on the cadence of the 
survey and other operational issues. In this paper we are 
assuming the as designed LSST survey mode (Section 3.1 1, 



where the minimum time between consecutive visits on the 
same patch of sky is approximately 30 minutes ( jlvezic et al.| 
|2011[ ). This means that the telescope has experienced at least 
-~50 different pointings between the two consecutive visits 
and is looking through a very different column of atmosphere 
each time. Under this scenario, most of the physical effects 
we discussed are truly stochastic, or at least stochastic to a 
very high level between visits. For the two exposures in the 
same visit, due to the close separation in time, stochasticity 
is not guaranteed for all effects - we discuss in Appendix [D] 



9 Because the measured ellipticity of a galaxy after convolution 
with a PSF depends nonlinearly on the width of the PSF, we 
make this assumption and only investigate the anisotropic part 
of the ellipticity change, which can be viewed as linear. 



10 Atmospheric effects here does not include effects that change 
the measured size of the galaxy such as variation in the seeing 
and airmass. Instead we use the median seeing and airmass as 
listed in Table [T| through the paper. 
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— original galaxy 

— circularized galaxy 
■-■ R=0. 4" Gaussian 

R=0. 65" Gaussian 
- -r B=0. 8" Gaussian 
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10 

9(arcmin) 



100 



Figure 2. Ellipticity correlation function for different samples of 
simulated galaxies. Black shows the ellipticity correlation func- 
tion for a realistic galaxy sample, while red shows the same func- 
tion measured from the circularised counterpart of the realistic 
galaxy sample. Green curves are measured from samples of circu- 
lar Gaussian shapes of different sizes. The green curve measured 
from circular Gaussians of 23rd r-band magnitude and i?=0.65" 
(green square) agree approximately in level and shape with the 
black and red curves, where R is the F WHM size of the Gaussian 
profile. This demonstrates (1) shape noise is uncorrelated and (2) 
we can use the circular Gaussian shape with r-band magnitude 
23 and R =0.65" as the "fiducial galaxy" to measure the spatial 
correlations of the response of the entire galaxy population. 



how this factor may be estimated for the spurious shear 
correlation function in the combined dataset. 



4.2 Practical considerations 

Real galaxies have intrinsic shapes, and will be subject to 
cosmic shear, so that in Equation Ifjl s* and 7 are not gen- 
erally equal to zero. To average over these effects at the 
statistical level sampled by LSST, we would need to sim- 
ulate ~200 images of roughly four billion galaxies in each 
test. This is computationally impractical, so we adopted a 
simpler approach described below. 

Note from Equations [6] and [7] that if we set up simu- 
lations so that e l = 7 = 0, we can avoid the contribution 
from e x and 7 in the observable s m , and directly measure e 3 
and £ e s e s unambiguously. This suggests that our problem is 
equivalent to asking the following question: 

Under zero shear, what is the anisotropic component 
of the spurious ellipticity e s induced by a certain physical 
effect on a circular object of e % — and what are the 
correlation properties £ £ s £ s of those spurious ellipticities? 

That is, we do not measure the ellipticity on a fully re- 
alistic galaxy population with a distribution of shapes, sizes 
and brightnesses; instead, simple circular "galaxies" are used 
as "test particles" for the entire population of galaxies. We 
show below that this approach is justified for our purposes. 

In Figure [2] we simulate a representative galaxy sample 
from the PhoSim sky catalogue and measure the elliptic- 
ity correlation function £ e m e m from a typical single expo- 
sure. We then "circularise" these galaxy images at the in- 



Table 3. Fiducial galaxy characteristics specified in r band. The 
representative sample of weak lensing galaxies can be collapsed 
into the fiducial galaxy and reproduce the same ellipticity corre- 
lation function - on average, the fiducial galaxy reacts to the PSF 
effects the same way as the population of galaxies. 



Parameter name 


Description 


Fiducial value 


m 


AB magnitude 


23 


S 


total signal counts 


~ 2600 counts 


R 


Gaussian FWHM 


0.65" 


"gal 


number density 


~ 5.5 /arcmin 2 


SNR 


signal-to-noise ratio 


8.33 



put catalogue level before entering the atmosphere so that 
they retain all the characteristics, such as size, brightness, 
redshift and spectral energy distribution (SED) in the orig- 
inal sample, but lose the shape information. Although the 
original sample shows a noisier ellipticity correlation func- 
tion, the circularised sample roughly agrees with it in both 
level and shape. The agreement between the ellipticity cor- 
relation functions measured from the original galaxies and 
the circularised galaxies demonstrates that shape noise is 
not spatially correlated; thus it should play no role in the 
correlation function as expected. In other words, we have 
£ e i £ ; = £ 77 = £ e «„ = £ e s 7 = for both samples. The slightly 
lower red curve is mainly due to the small £ e i e s term that is 
present only in the original galaxy sample. We show that we 
can isolate £ e s e s in the £ e m £m using the circularised galaxy 
sample. 

To further simplify the problem, the distribution of cir- 
cular galaxies is collapsed into a single circular Gaussian 
shape. By exploring the size-magnitude parameter space, 
we find that using roughly the average magnitude, size and 
number density of the original sample, we can recover the 
ellipticity correlation of the circularised galaxy sample. For 
the rest of this paper, we will refer to this special circular 
Gaussian as the "fiducial galaxy." Figure[2]shows the elliptic- 
ity correlation functions for three different sizes of circular 
Gaussian shapes, with the middle one (square) being the 
fiducial galaxy. The characteristics of the fiducial galaxy are 
listed in Tabled 

The construction of the fiducial galaxy is an approxima- 
tion, but is appropriate for our analyses with the following 
caveats. First, by taking the ellipticity results from a circular 
galaxy (e ! = 7 = 0) as a general result for the whole galaxy 
population, we are assuming that the average ellipticity error 
on the population of galaxies is approximately the ellipticity 
error on the average galaxy in the population. This is ignor- 
ing the fact that certain algorithms may tend to measure 
the ellipticity of a galaxy more accurately when the galaxy 
is more circular or more elliptical. This intrinsic-ellipticity- 
dependent error may introduce additional errors in the ellip- 
ticity measurements. We ignore them because these errors 
are algorithm-dependent, and are spatially uncorrelated, i.e. 
they only contribute a small addition contribution to shape 
noise. Second, by choosing a Gaussian profile for the fidu- 
cial galaxy rather than a more realistic Sersic-type profile, 
we are assuming our ellipticity measurement method per- 
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forms equally well on Gaussian profiles and realistic galaxy 
profiles. This again is to eliminate the algorithm-dependence 
coupling to the problem and also important later for shear 
measurements as discussed in Section [6] 



5 QUANTIFYING ERRORS ON ELLIPTICITY 
MEASUREMENTS 

In this section, if not otherwise specified, the measured el- 
lipticity on any simulated galaxy image is effectively an "el- 
lipticity error" generated from a cer tain physical effect, for 

0). Thus 



4.2 



reasons we have explained in Section 

we omit the superscripts in our notation and use s (£ e 
instead of e s (fc s e s ) or e m (£ £ ™ e ™). 

Also, for all ellipticity measurements, we define the 
quantity a[e] to be a measure of the uncertainty in ellip- 
ticity measurements due to a certain physical effect. a[e] 
is denned as the square-root of the quadrature sum of the 
standard deviation of individual ei and E2 distributions (as 
opposed to the standard deviation of e): 



(8) 



5.1 Non-stochastic effects 

5.1.1 Simulations 

To examine the non-stochastic effects, we generated two sets 
of simulations. Each set of simulations consists of one or 
more full LSST focal planes. In all of the simulations, since 
we need to "turn off" the atmospheric effects, we convolve 
the fiducial galaxies with a circular Gaussian before running 
the simulations to ensure that the observed galaxies have 
the same size as a fiducial galaxy observed under the fidu- 
cial observing conditions. To suppress the contribution from 
counting statistics errors in the measurements, all objects 
are generated with high SNR at ~ 160. 

In the first set of simulations, we simulate the "as 
designed instrument" by including only the optics design, 
isotropic charge diffusion in the CCD detectors, and pix- 
elisation. In the second set of simulations, we include non- 
stochastic perturbations to the optics due to solid body mis- 
alignments, surface perturbations of the major optics ele- 
ments, and warping and misalignment of the individual sen- 



sors in the focal plane (see Table Al for approximate levels 
of the major perturbations). Twenty focal-plane size images 
with different Gaussian random realisations of these errors 
are generated in order to capture the effects in a "typical" 
LSST observations. 



5.1.2 Results 

In Figure [5] (a) , we show the ellipticity magnitude e mea- 
sured from the simulations across the LSST focal plane for 
the as designed instrument. When non-stochastic optics er- 
ror is induced, the measured ellipticity changes. One exam- 
ple in the set of simulation is shown in Figure [3] (b), where 
the change in ellipticity due to a certain set of optics errors 
is shown. 

The distribution of s for the two sets are plotted in 



Figure[4](a) with arbitrary normalisations. The correspond- 
ing o~\e] values of these distributions after correcting for 
the counting statistic^""] are listed on the plot. We mea- 
sure a[e] ~7x 1CT 3 for the design, a[e] ~ 6 x 1(F 3 for the 
non-stochastic optics effects. The total ellipticity contribu- 
tion from all non-stochastic effects on the fiducial galaxy is 
a[e] ~ 9 x 1(T 3 . 

The median absolute ellipticity correlation function 
measured from the fiducial galaxies in all the simu- 
lations is plotted in Figure [4] (b) for the design and for the 
non-stochastic optics effects added. The total ellipticity cor- 
relation function for all effects in this "non-stochastic" class 
is also shown. The correlation function for the design is at 
the level ~ 1.5 x 10~ 5 with a rather flat shape. Adding non- 
stochastic effects almost doubles the level of the correlation 
function. 

Note that Figure [4] may not be characteristic of other 
telescopes, since we have utilised a large amount of LSST- 
specific information about the optics configuration and en- 
gineering tolerances ( Ivezic et al.|20lT |. However, the main 
message from this section is the demonstration that for fu- 
ture large telescopes with designs similar to LSST, the non- 
stochastic spurious ellipticity correlation will be at a low 



level compared with existing telescopes (see Jarvis & Jain 
20041 for example). 



5.2 Stochastic effects 

5.2.1 Simulations 

To examine the contributions of stochastic effects, we use 
one realisation of the non-stochastic optics errors in the 
previous simulations and then add on stochastic contribu- 
tions that vary randomly from exposure to exposure. Non- 
stochastic contributions to the ellipticities are later sub- 
tracted component-wise from the measured ellipticities to 
obtain the stochastic contribution. 

For each of the four stochastic effects listed in Table [2j 
we generate a set of 20 focal-plane-size simulations with fidu- 
cial galaxies distributed over the field. The input parameters 
to the 20 simulations in each set are controlled so that each 
of the other three effects are "turned off" and only one ef- 
fect is "turned on" - only parameters associated with that 
one effect are allowed to vary. In addition, we generate one 
set of simulations (20 focal-plane-size image), where the four 
stochastic effects are all turned on. These images are used 
in Section[7|for shear measurement tests. We describe below 
the prescriptions for how we set the parameters for each of 
the effects in the simulations. 

Counting statistics is the largest stochastic source of 
noise in these single exposures. It is also the only effect that 
is stochastic in both space and time, which prohibits it from 
being corrected through PSF modelling. 

The relevant measure for counting statistics is the SNR 

11 We will see later in Section |5.3| that for the ellipticity mea- 
surements done with SNfv~160 objects, there exist uncertainties 
from counting statistics at the a[e] ~ 5 x 10 — 3 level (see Equa- 
tion [l2j, which we need to subtract in quadrature from the raw 
measurements to isolate the uncertainties due to the specific effect 
of interest. 



8 C. Chang et al. 




0.020 
0.018 
0.016 
0.014 
0.012 
0.010 
0.008 
0.006 
0.004 
0.002 
0.000 




(a) 



(b) 



Figure 3. Ellipticity magnitude e measured from the fiducial galaxies over the LSST focal plane for the (a) optics design and (b) one 
example of adding non-stochastic optics errors. All non-stochastic optics effects induce ellipticity magnitudes <0.02 for most of the field. 
Greater ellipticites are mainly near the edges. The large-scale variations come from perturbations of the positions and surface heights of 
the mirrors and lenses, while the variation in the focal plane height contributes to the visible boundaries between individual CCDs. 




Figure 4. The following abbreviations are used for the different sources of ellipticity errors: DES (design), OPT_NS (non-stochastic 
optics effects) and ALL_NS (all non-stochastic effects), (a) Distribution of the ellipticity magnitude measured for the fiducial galaxies 
when different non-stochastic effects are added, (b) Absolute correlation function of the ellipticity errors for the fiducial galaxies when 
different non-stochastic effects are added. The red curve is the median value for 20 different realisations of the non-stochastic optics 
effects. 



of an object, which we can calculate straightforwardly for 
a circular Gaussian profile given the total signal counts S, 
object FWHM size R, background counts -B s k y and apparent 
object FWHM size R m : 



SNR: 



0.7 x S 



y/0.7 x S + 7r(1.34 x R m ) 2 x B sky 



(9) 



Here we use a typical aperture radius of ~1.34 times the 
FWHM of the apparent object size R m , containing ~70% 



of the source counts. For circular Gaussian, the apparent 
object size R m can be approximated as the object size R 
convolved with a circular Gaussian with FWHM size equal 
to the PSF size a: 



(R m ) 2 =R 2 +a 2 



(10) 



where a can be estimated by adding to a atm in quadrature 
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the instrumental PSF contribution a op 



opt 



(11) 



In this section, we fix S, R, B Eky and a to the fiducial 
values in Tables [T] and [3] In Section |5,3| we explore the 
full SNR parameter space. The 20 focal-plane-size images 
in this set are identical except that different photons are 
drawn from the galaxy and therefore travel slightly different 
light-paths. 

Stochastic optics effects are the residual optics errors 
after AOS correction that do not show repeatable patterns 
from exposure to exposure. The 20 focal-plane-size images 
in this set are identical except that the optics perturbations 
are varied from frame to frame within the level allowed by 
the adopted tolerances listed in Table [AT] 



Tracking errors occur due to imperfect tracking of the 
telescope during the exposure. They cause the measured ob- 
ject shape to be slightly elongated in the direction of the sky 
rotation. The 20 focal-plane-size images are identical except 
that a different tracking error trajectory within the adopted 
tolerances described in Section |A2| is assigned to each reali- 
sation. 

Atmospheric effects are slightly more complicated to 
model. For the same assumed fiducial seeing in all 20 simula- 
tions, different realisations of the atmosphere are generated 
by different combinations of the structure function, outer 
scale, wind speed and wind directions over the multiple at- 
mospheric layers. The 20 focal-plane-size images are identi- 
cal except that a different combination of these parameters 
is used. 

For the first three sets of simulations, since the atmo- 
sphere is turned off, all objects are convolved with a circu- 
lar Gaussian before we run the simulator so that the mea- 
sured object size is the same as if it had propagated through 
the atmosphere. Similar to Section [5.1.1[ for all sets except 
the first, objects are generated with high SNR at ~ 160 
to suppress contribution from counting statistics errors in 
the measurements. The ellipticity of each object is mea- 
sured and the mean ellipticity for each object over the 20 
realisations is taken as the non-stochastic contributions and 
subtracted component-wise to yield the stochastic ellipticity 
component. 



5.2.2 Results 

Figure [5] shows one example of the absolute ellipticity er- 
rors in each of the four sets. The colour mapping for the 
four plots are adjusted to best illustrate the spatial patterns 
and absolute ellipticity levels. Note that in Figure [5] (a) the 
CCDs in the corner of the field are missing. This is because 
the fiducial galaxies have very low SNR at those vignetted 
locations. In a more realistic field, brighter galaxies will still 
be detected there. 

Distributions of the magnitudes of the stochastic ellip- 
ticity errors e measured from the four sets of simulations 
are plotted in Figure [6] (a). Each of the curve is normalized 



so that it peaks at . Also overlaid is the total stochastic el- 
lipticity error distribution. The corresponding a[e] values of 
these distributions after correcting for counting statistics 11 
are listed on the plot. Figure[6](c) is a zoomed-in view of Fig- 
ure [6] (a) on the lower ellipticity values. Clearly, in a single 
exposure the dominant error contribution to the shape mea- 
surements for a fiducial galaxy is counting statistics, giving 
a[e] ~ 1.1 x 10 _1 . The atmospheric effects and the stochas- 
tic optics effects are at similar levels and are the second 
and third largest contributors, giving a [el ~ 1.2 x 10~ 2 and 
a[e] ~ 1.1 x 10~ 2 respectively. Tracking errors are the most 
insignificant effect of the four, with a[e] ~ 5 x 10~ 3 . The 
total stochastic ellipticity uncertainty when all effects are 
turned on is a[e] ~ 1.3 x 10 _1 . Note that the total non- 
stochastic ellipticity error discussed in Section |5.1| is more 
than an order of magnitude smaller than the total stochastic 
ellipticity errors. 

We now turn to the median correlation functions of 
these four stochastic ellipticity error components, shown in 
Figure [6] (b), along with the total stochastic ellipticity error 
correlation function. The error bars in each case show the 
standard deviation of the 20 realisations divided by \/20. 
The first observation is that although counting statistics er- 
rors dominate in the ellipticity error as shown in Figures [5] 
and [6] (a), they are completely uncorrelated. They oscillate 
rapidly but are always consistent with zero. This is of course 
expected - regardless of the SNR of the measured objects, 
counting statistics makes no contribution to the correlation 
function. Tracking errors, on the contrary, being the lowest 
in Figure [6] (a), contribute to a small but non-zero correla- 
tion at the 10 level. The stochastic optics effects generate 
ellipticity correlations slightly below the 10 -4 level while the 
atmospheric effects contribute to a similar level of ellipticity 
correlation with a steeper shape as can be seen more clearly 
in Figure|6](d), the zoomed-in view for Figure [6](b). The to- 
tal ellipticity correlation function is thus dominated by the 
atmosphere component at small scales and then a combi- 
nation of the stochastic optics errors and the atmospheric 
effects at larger scales. Also notice that the non-stochastic 
component is approximately an order of magnitude smaller 
than the total stochastic ellipticity correlation in a single 
exposure. 



5.3 Discussion 

Up to this point, we have quantified cr[e], the expected levels 
of errors in ellipticity measurements due to different physi- 
cal effects for a typical LSST single exposure. These results 
can be scaled to other observing conditions and source dis- 
tributions as a first-order estimation for the uncertainties in 
ellipticity measurements in another dataset. The two major 
quantities that govern the scaling of a[s] and £ ee for a cer- 
tain dataset are the average observed object size R m (Equa- 
10 1 and the average SNR (Equation of the objects. 



tion 



The level of counting statistics contribution to ellipticity er- 
rors a[e] is expected to scale with some function of SNR, 
while ellipticity errors from all the other effects scale witrf 12 ] 



12 This can be derived by assuming the measured ellipticity 
comes from an elliptical Gaussian PSF convolved with a cir- 



cular Gaussian galaxy. If the PSF has second moments V- 
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(c) (d) 

Figure 5. Ellipticity magnitude e measured from the fiducial galaxies over the LSST focal plane for four stochastic effects: (a) counting 
statistics, (b) tracking errors, (c) stochastic optics errors and (d) atmospheric distortions. In each case, the colour bars are adjusted to 
best show the ellipticity spatial pattern. Since these effects are stochastic, we show only one representative realisation of the random 
process to illustrate the kind of ellipticity pattern induced by each effect. Among the four, (a) counting statistics induces the highest level 
of errors and show "missing" sensors on the edge of the field due to the fact that vignetting causes the fiducial galaxies to be undetectable 
at those positions. 



(R m ) 2 . For £ eE , on the other hand, the counting statistics 
contribution is essentially zero, while all other components 
scale with (i? m )~ 4 . 

Note also that in general the measured level of the cor- 



and the galaxy has second moments Iij, then because the mo- 
ments are effectively summed in the convolved image, and In — 
I22 = I12 = 0, the convolved object has ellipticity (£1,^2) = 
I 11+ /( l+ j 22+ j^ 2 Wi ~ / 22' 2/ 12) K (ijW) 2 (-fii - I 2 2' 2I 12)- 



relation function is lower than what is expected for a naive 
assumption of |£ es | ~ f[e] 2 - This is because the distortions 
of the galaxies are usually only partially correlated in space. 
The degree of correlation, which is governed by the physi- 
cal mechanism that induces the correlation, determines how 
close |£ Ee | approaches cr[e] 2 . 

To determine the scaling of a[s] with SNR, we perform 
a series of simulations similar to the first set of simulations 
{i.e. counting statistics) in Section [5. 2. 1| but vary the input 
galaxy's size and magnitudes over the range R = [0" (point 
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Figure 6. The following abbreviations are used for the different sources of ellipticity errors: CS (counting statistics), OPT_S (stochastic 
optics effects), TRK (tracking errors), ATM (atmospheric effects), and ALL_S (all stochastic effects), (a) Distribution of the ellipticity 
magnitude measured for the fiducial galaxies when different stochastic effects are added. The grey shaded area indicates the ellipticity 
magnitude distribution from all the stochastic effects together, (b) Absolute correlation function of the ellipticity errors for the fiducial 
galaxies when different stochastic effects are added. Note that all curves plotted for the stochastic effects are the median value for 20 
different realisations, with the error bars showing the standard deviation of the 20 realisations divided by \/20. Negative values are 
plotted with open symbols, (c) is a zoomed-in view of (a) on the lower ellipticity values and (d) is a zoomed-in view of (b) on the higher 
correlation function curves. 



source), 0.5", 0.7", 0.85", 1.0", 1.5", 2.0"] and m = [18, 19, 
20, 21, 22, 23, 24] to cover a nominal galaxy population and 
plot a[s] as a function of the object's SNR. The results are 
illustrated in Figure [7] We find that even for the wide range 
of size and brightness sampled, the ellipticity errors for all 
objects lie on a power law curve of index ~ — 1, described 
by the fit: 

0.875 (12) 



0.875 x SNR" 



SNR 



where the subscript CS indicates the ellipticity uncertainty 
due to counting statistics errors only. Equation [12] is consis- 
tent with the analytical predictions and numerical simula- 



tions ( Paulin-Henriksson et al. 2008 Refregier et al. 20121 



in previous studies. 

In Figure [T| we show the breakdown of a[e] into differ- 
ent components for a galaxy of FWHM size ~ 0.65" under 



fiducial LSST observing conditions (Table [I]) as a function 
of the SNRs of the objects. The top axis m* shows the cor- 
responding r-band AB magnitude for objects at that SNR. 
The errors due to all non-stochastic effects and the errors 
due to all stochastic effects except counting statistics are by 
definition independent of the SNR of the galaxy, therefore 
they are represented by horizontal lines on the plot. The to- 
tal errors from stochastic effects are derived by adding the 
level of counting statistics contributions and other stochastic 
errors in quadrature. 

Under the assumption that these individual noise terms 
are approximately independent from one another, we can 
now estimate the uncertainty in ellipticity measurements of 
an arbitrary galaxy under an arbitrary scenario by scaling 
the results from our tests with the fiducial galaxies and con- 
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ditions, which we denote by the subscript "0" in the following 
steps: 

(i) Calculate the galaxy's SNR, which depends on R m , S, 
a and _B s k y , and scale the counting statistics errors on the 
ellipticity from the fiducial case via: 

SNRo 



a[e\ oc 



SNR 



(13) 



(ii) Calculate the contributions from all other effects by 
scaling the results in Sections |5.1| and |5.2| by the measured 
galaxy size: 



a[e] oc 



R m 



(14) 



(iii) All stochastic components are further scaled by the 
exposure time: 



'exp,0 
^exv 



(15) 



(iv) Add the individual components in quadrature to 
yield an estimate of the total ellipticity uncertainty in the 
measurement. Note, however, that the simple assumption 
that all effects are decoupled breaks down at the low SNR 
end, where the errors are no longer small and cannot be 
linearly decomposed into the different components. 

It is straightforward to also estimate the ellipticity error 
correlation function of a population of galaxies: 

(i) Since counting statistics errors do not correlate, we do 
not need to account for a correlation function for them. 

(ii) Scale the individual correlation functions in Figures [4] 
and [6] by the average R m of objects in the frame: 

4 



Zee OC 



■ftp 
R m 



(16) 



(iii) Scale the stochastic components of the individual 
correlation functions by t exp : 



Zee oc 



^exp,0 
texv 



(17) 



(iv) Add the individual components to yield the total el- 
lipticity correlation function. 

These scaling relations serve as first-order estimates of 
the ellipticity errors and error correlations. 



6 SOURCES OF SPURIOUS SHEAR 

After measuring the ellipticities of the galaxies (e m in Equa- 
tion [fjj, the next step, following Equation 18 is to estimate 
the PSF-induced ellipticity errors e s , estimate the scaling 
factor a and calculate the shear estimator ■y: 



7 : 



1 / 71 



7 - 



(18) 



The first part of Equation [18] represents operationally how 
one would calculate 7 and the second part comes from rear- 
ranging Equation [6] When averaging 7 over a large number 
of galaxies, we recover the true shear, or (7) = 7. As de- 
scribed in Section |4.2| since we have set both shear and 
intrinsic ellipticity to zero in our simulations, any non-zero 
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Figure 7. Uncertainties of ellipticity measurements induced by 
counting statistics errors for objects in a large range of size and 
magnitudes is labeled by the black triangles, which is well fit by a 
function of the objects' SNR as shown with the grey dashed line. If 
all the parameters other than magnitude (m, or effectively, 5) are 
held fixed, then the bottom SNR axis corresponds to the r- band 
AB magnitudes labeled on the top axis m*. The yellow diamond 
at m* = 23 is where the fiducial galaxy is located. Under the same 
assumption that we only vary the object's magnitude along the 
x-axis, we can then plot the ellipticity uncertainties induced by 
all the other physical effects discussed in this paper as horizontal 
lines since these errors depends only on the object's size. Finally, 
the level of intrinsic shape noise is indicated by the solid blue line. 



shear measurement from Equation |18| even without averag- 
ing over an ensemble of galaxies, indicates mis-estimation 
of e s and/or a. In other words, any measured shear from 
our simulations is spurious. In the remainder of this section, 
we use 7 s and £ 7 s 7 = to indicate the spurious shear and their 
correlation functions. As opposed to previous ellipticity mea- 
surements, we only show results for £ 7 s 7 s (rather than single 
7 s measurements), and propagate them directly into uncer- 
tainties of the inferred cosmological model in Section |8.2| 

Operationally, three separate steps are involved in cal- 
culating the terms of Equation [18] that are prone to system- 
atic effects: PSF modelling (estimating e s ), "deconvolving" 



with the PSF| j (properly removing the effect of the PSF in 
estimating intrinsic galaxy shape) and converting the mea- 
surement to shear (estimating a). The first step is related 
to properly modelling the physical effects discussed previ- 
ously, while the latter two steps are determined by choices 
of algorithms. We use the terms "spurious shear from PSF 
modelling" and "spurious shear from shear measurement al- 
gorithms" to refer to these two classes of errors. 

Given a perfectly known PSF, an imperfect algorithm 
can still render spurious shear. Several works have studied 
the effectiveness of different PSF deconvolution algorithms 



and quantified the errors in shear measurements (Heymans 
|et al.|[2006l |Massey et aLl[2007l |Bridle et aL"1|2010| |Kit~ 



13 



Deconvolution here implies some algorithm that removes the 



effects of the PSF from the galaxy images, which may or may not 



be a mathematically exact deconvolution. 
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ing et aL||2012a|b I. This spurious shear studied in previous 
work ("spurious shear from shear measurement algorithms" 
in our classification) is not necessarily intrinsic to the mea- 
surement, and is therefore not the main interest of this pa- 
per. We choose to use KSB, one of the most popular weak 
lensing algorithms, as our test method, but design the simu- 
lations and analyses as described below to eliminate some of 
the known flaws in this method. All of our results can thus 
be viewed as the best possible results achievable by a KSB 
pipeline. In principle, more sophisticated pipelines should do 
even better. 



As mentioned in Section |4.2| we choose to use simple 
circular Gaussians as galaxies to perform our analyses. In 
terms of shear measurement, this implies that our estimates 
of the spurious shear from these galaxies will not be heavily 
affected by the choice of a simplistic moment-based method 
like KSB. Furthermore, to account for the "calibration fac- 



tors" often used in KSB-like algorithms ( 


Hey mans et al. 


2006| |Massey et al.||2007| |Bridle et al.||2010 


| |Kitching et al. 


2012a|b|), which are derived from simulations and intended 



to calibrate the process that converts ellipticity to shear, we 
use a calibration factor that shifts the mean shear in each 
frame to zero, which effectively performs a perfect calibra- 
tion for the additive shear error. 

On the other hand, spurious shear induced by PSF mod- 
elling is less dependent on the specific shear measurement 
algorithm; instead, it is heavily affected by the nature of the 
various physical effects. In fact, to model the PSF across an 
image, we are really just modelling the response function 
of a point source to all the physical effects across the focal 
plane. For a multi-epoch survey like LSST, the two classes 
of physical effects - non-stochastic and stochastic - should 
be modeled differently. 

For the non-stochastic errors, since they show repeated 
patterns over multiple exposures, there is a massive number 
of stars that contain information to constrain the model. 



Jarvis & Jain (2004) first suggested the concept of detecting 
the repeated patterns in the data themselves via principle 
component analysis (PCA). For current surveys, this is be- 
coming a standard operation for PSF modelling in weak lens- 
ing analyses. The power of PCA scales with the total num- 
ber of stars in all the exposures, which essentially scales with 
l/v^/V, where N is the number of exposures taken with sim- 
ilar observing configurations ( Jain et al.||2006"| ). For LSST, 
we believe that the large number of exposures in the sur- 
vey would enable us to characterise the non-stochastic PSF 
very accurately. PSF variation induced by stochastic effects, 
however, can be captured only from stars in a single expo- 
sure, which are both sparse and noisy. The stochastic PSF 
variation would be modeled poorly in a PCA-like approach. 

Note that the two classes of shear measurement errors 
are not necessarily decoupled, making them difficult to sep- 
arate from one another. In this work, our goal is to quan- 
tify the former, the "spurious shear from PSF modelling". 
We do this by first eliminating the algorithm-dependence in 
our shear measurement algorithm by carefully designing the 
simulations and analysis pipeline, and then by testing for 
any residual shear errors from the algorithms with perfect 
knowledge of the PSF model (Section [7T| . 



7 QUANTIFYING ERRORS ON SHEAR 
MEASUREMENTS 

We have shown in Section [5] that the total ellipticity error 
correlation is at the 10 -4 — 10~ 3 level for a fiducial LSST 
single exposure. In this section, we correct the PSF effects 
in these simulations and measure the spurious shear corre- 
lation. Three different PSF model scenarios are considered: 
The first assumes perfect knowledge of the PSF; the second 
assumes that a PCA-like method is used to model the PSF, 
yielding perfect knowledge of the non-stochastic component 
of the PSF but no information about the stochastic com- 
ponent of the PSF; the third assumes that we attempt to 
model both components of the PSF simultaneously by us- 
ing a standard method - interpolating a smooth polynomial 
function between measurements of individual stars. By per- 
forming these three tests and examining the residual shear 
correlation function, we can pin down the sources of spurious 
shear correlation functions. 

All analyses are measurements of the spurious shear 
from fiducial galaxies in the set of 20 focal-plane-size simula- 
tions described in Section [5. 2. 1 1 that contain all the physical 
effects modelled in PhoSim; we will refer to this set of sim- 
ulations as the "master set". In the three subsections below, 
we describe the simulations used to obtain the three dif- 
ferent PSF models and show the spurious shear correlation 
functions we measure from the master set using the three 
PSF models. 

Also, if not otherwise specified, since the measured 
shear of any simulated image is effectively "spurious shear" 
generated from the PSF modelling and correction process 
(e* = ■y = 0), we omit the superscripts in our notation and 
use 7 (£77) instead of 7 s (£ 7 s 7 s) or i {(.a). 



7.1 Perfect PSF model 

In this test, since the spurious shear from PSF modelling is 
by definition zero, the spurious shear we measure indicates 
any imperfections of the KSB implementation we adopted. 



7.1.1 Simulations and results 

We generate a set of 20 focal-plane-size images identical to 
the master set, except that at the location of each galaxy, we 
simulate a bright star instead. The shape of each bright star 
is measured and the shape parameters are used to construct 
the PSF models for its galaxy partner in the master set. 

In Figure [8] (a), we show the median shear correlation 
function for the 20 simulations. We show that by using a 
perfect PSF model, the spurious shear correlation is noisy 
but consistent with zero. This suggests that our idealised 
KSB implementation corrects the PSF effects nearly per- 
fectly. Also plotted in Figure [8] (a) for comparison are the 
ellipticity correlation function for the galaxies and the ellip- 
ticity correlation function of the PSF model. The PSF spa- 
tial correlation prints through and is apparent in the shear 
correlation as can be seen by the similarities between the 
blue and red curves. The error bars show the standard de- 
viation in the 20 realisations divided by \/20. 
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7.2 Perfect non-stochastic PSF model 

Next we assume that the non-stochastic component of the 
PSF can be characterised perfectly by analysing a large num- 
ber of exposures via, for example, a PCA method. However, 
no attempt to model the stochastic component of the PSF 
has been made. In this case, one would construct PSF mod- 
els that capture only the non-stochastic effects discussed in 
Section r5.ll The measured shear correlation function in this 
test is the spurious shear from not modelling and correcting 
for the the stochastic PSF variations. 



7.2.1 Simulations and results 



Similar to Section |7.1.1| we generate one focal-plane-size im- 
ages with only the non-stochastic effects in the master set 
included and replace each galaxy in the master set with a 
bright star. The shape of each bright star is measured and 
the shape parameters are used to construct the PSF models 
for its galaxy partner in the master set. 

In Figure [8] (b), we show the median shear correlation 
function for the 20 simulations. With corrections only for 
the non-stochastic component of the shape of the PSF, the 
spurious shear correlation for a single 15-second exposure is 
at the few times 10 -4 level. Also plotted for comparison are 
the ellipticity correlation function for the galaxies and the 
ellipticity correlation function of the PSF model. The error 
bars show the standard deviation in the 20 realisations di- 
vided by y/20. Since we have shown in Figures ||]and|6] that 
the level of the stochastic ellipticity error correlation func- 
tion is more than one order of magnitude larger than that 
for the non-stochastic ellipticity errors, it is reasonable that 
there are large spurious shear correlations when we correct 
only for the non-stochastic effects. The main effect of the 
PSF correction in this scenario is to correct for the PSF size 
and the weighting factor - very little PSF ellipticity spatial 
variation is corrected. 



7.3 Model both non-stochastic and stochastic 
PSF via polynomial models 

In Section 17.2.11 we have shown that even when the non- 
stochastic PSF is corrected, there can still be large shear 
residuals in single exposures due to stochastic PSF ef- 
fects. This motivates us to model both stochastic and non- 
stochastic PSF components simultaneously. One common 
approach is to fit certain shape parameters of stars across the 
individual CCD sensors with a low order polynomial func- 
tion, with the underlying assumption that the PSF spatial 
variation is smooth on individual sensor scales. The shear 
correlation function determined from this test is a measure 
of the spurious shear arising from incorrectly modelling and 
correcting for the stochastic and non-stochastic PSF varia- 
tions using polynomial PSF models constructed from stars. 



7.3.1 Simulations and results 

We generate a set of 20 focal plane-size images identical to 
the master set, except that the fiducial galaxies are replaced 
by a realistic star sample obtained from the PhoSim sky 
catalogue, randomly located over the field. On average each 



sensor-size image contains 120 — 150 stars used for PSF mod- 
elling (SNR>13). The shape of each star is measured and 
the shape parameters are interpolated with nth-order poly- 
nomials onto the locations of the galaxies to obtain the PSF 
model at the location of the galaxies in the master set. We 
tested for several n values and show only the best case (n=5) 
here. 

Figure [8] (c) shows the residual shear correlation func- 
tions when a 5th-order polynomial interpolation of stars is 
used to model the PSF. Also plotted are the ellipticity corre- 
lation function for the galaxies and the ellipticity correlation 
function of the 5th-order polynomial PSF model. The error 
bars show the standard deviation of the 20 realisations di- 
vided by \/20. 

Excess power is present on small scales in the shear 
correlation function and the slope has a slight transition at 
~ 3', beyond which the curve decreases less steeply. The 
negative correlation on large scales is an artifact from the 
shear calibration procedure described in Section |6j where 
the measured shear distribution in single measurements is 
calibrated to have zero mean, forcing part of the positive 
correlation to become negative. This excess power on small 
scales is expected, since structures within scales smaller than 
[sensor size]/n cannot be modeled by a polynomial of order 
n, where [sensor size]~ 18' in our simulations, the part of 
the PSF not modeled by the polynomial prints through as 
spurious shear correlation. The fact that the PSF variations 
on small scales have significant power coming from the at- 
mosphere, which we have shown in Section [5. 2| means that 
the spurious shear correlation will also have excess power on 
these small scales. We measure the level of spurious shear 
correlation for a single 15-second exposure using a polyno- 
mial PSF model as 5 x 10 -4 at small scales and decreasing 
by two orders of magnitude towards larger scales. We have 
also examined how the different n values affect the level of 
the correlation function, and found that the general shape 
of the shear correlation remains similar to the n=5 case but 
the transition point where the correlation starts to rise at 
small scales changes according to [sensor size]/n. To improve 
upon this simple polynomial model, one would need to de- 
velop a more flexible interpolation technique that captures 
structures on different scales in a more efficient way. We pro- 



pose such an approach in a companion paper (Chang et al 
[20T2l. 



8 DISCUSSION 

8.1 Combining multiple exposures 

We now estimate the spurious shear correlation function in 
a combined 10-year LSST dataset. In the most simplistic 
case where all N exposures on the same galaxy field have 
similar image quality, we show in Appendix[C]that averaging 
the shear measurements in the N exposures suppresses the 
stochastic piece of the spurious shear correlation by a factor 
N. But in a realistic case of varying image quality, the N 
scaling is no longer straightforward. One needs to estimate 
the "effective number of exposures", or N c g, taken on each 
galaxy, which essentially weights each exposure according to 
the image quality. We direct the reader to Appendix [D] for 
how we estimated the value of N e e that is suitable for our 
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Figure 8. The absolute ellipticity correlation function for the 
fiducial galaxies before PSF correction (dotted black) and the 
absolute shear correlation function after PSF correction (solid 
green) for (a) perfect PSF models, (b) perfect non-stochastic PSF 
models, and (c) PSF models constructed with a 5th-order polyno- 
mial fit to bright stars. The absolute ellipticity correlation func- 
tion (dashed red) in each case is plotted for comparison. All curves 
are the medians of 20 different realisations under the fiducial ob- 
serving condition, with the error bars indicating the standard 
deviation in the 20 curves divided by \/20- Negative values are 
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Figure 9. Absolute spurious shear correlation function after com- 
bining 10 years of r- and i-band LSST data when a standard 
KSB algorithm is implemented and the PSF is modeled at dif- 
ferent levels: non-stochastic PSF knowledge only (dashed) and 
partial stochastic PSF knowledge from polynomial interpolation 
of stars (solid). Red lines indicate the pessimistic case assuming 
N e f[ = 184 while the green lines show the optimistic case when 
iV e ff = 368 is assumed. All curves are the medians of 20 different 
realisations under the fiducial observing condition, with the error 
bars indicating the standard deviation in the 20 curves divided 
by \/20. Negative values are plotted with open symbols. 



analysis. From Appendix [Dj we estimate N e g to be between 
184 and 368, with N e g = 184 being the most pessimistic 
scenario and N c g — 368 being the most optimistic. 

Consider now the three scenarios described in Section[7] 
where KSB is used to correct for the PSF effects and the 
three levels of PSF modelling are assumed. For a hypothet- 
ical perfect PSF modelling technique, the shear errors in 
individual frames are already consistent with zero, so there 
is no need to discuss the combined results here. 

For the second case, we know only the non-stochastic 
component of the PSF. In this case, spurious shear correla- 
tions result from not modelling any of the stochastic compo- 
nent of the PSF shape. In the combined dataset, the latter 
contribution can be estimated by taking the solid spurious 
shear correlation function in Figure [8] (b) and multiply by 
1 /N a f! to account for the averaging of the stochastic spurious 
shear correlation. 

When both the non-stochastic and stochastic PSF com- 
ponents are modeled using a 5th-order polynomial model fit- 
ted to the stars, we assume that the smoothly varying non- 
stochastic PSF component is fully modeled and the spurious 
shear is mainly due to stochastic PSF modelling errors. The 
combined shear correlation function then can be estimated 
by scaling the spurious shear correlation function in Figure[8] 

(c) by 1/iVaff. 

The total expected spurious shear correlation functions 
from combining N e g exposures for the latter two cases are 
shown in Figure [9] 
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8.2 Implication for constraints on cosmological 
parameters 

We now interpret the spurious shear correlation function 
derived in Section [8.1| in terms of the implied uncertainties 
on inferred cosmological parameters. 

Since ~y = in all our analyses, we can identify the £ 77 
measured in Section[7]to be the "additive spurious shear cor- 
relation function" £ add introduced in Huterer et al. (20061. 
According to AR08, for several hypothetical forms of the 
spurious shear power spectrum, one can calculate the upper 
limits for allowed systematic errors of predictions of the ma- 
jor cosmological parameters via a simple extension to the 
Fisher Matrix formalism. This upper limits on the system- 
atic errors are set so that the systematic errors do not exceed 
the statistical errors. In a survey with statistical power sim- 
ilar to the LSST survey, AR08 suggests the following limits 
on the spurious shear power spectrum: 



1 

2^ 



(19) 



where C add is the power spectrum corresponding to £ add , 
which can be derived through Equation [5] 

Equation [19] is in the form of shear power spectra, but 
our measurements are in the form of shear correlation func- 
tions. To properly connect our results to Equation |19| we 
revisit the hypothetical power spectrum used in AR08: 

CLM = (" lo Sio(^) + 1) , (20) 

where n is the slope of the log-linear power spectrum, £o is 
an arbitrary reference point chosen to be 700 in the paper 
and Ao is the normalisation. 

Since the analytical form of Equation |20] is straightfor- 
ward to integrate, we can use Equation [5] to find the cor- 
relation functions that correspond to power spectra in the 
form of Equation [20] for a range of n and Ao values. These 
correlation functions then can be compared to the spurious 
shear correlation functions in Figure [9] to determine the best 
matched n and Ao values. We calculate for this particular 
set of n and Ao, the al ys values and compare with the tar- 
get set via Equation |19| This process gives us an estimate 
of the level of uncertainties in the cosmological parameters 
when these forms of systematic errors in the shear correla- 
tion function are present. A more accurate estimate of a^ ys 
can be obtained by the full Fisher Matrix calculation using 
these measured shear correlation functions. 

We explore the parameter space — 3 ^ n ^ 1 and 
10~ 9 ^ Ao ^ 10~ 5 , which is chosen to be consistent with 
the ranges tested in AR08. In this range, we find that the 
family of functions is not always a good description for the 
spurious shear correlation function we measure from simu- 
lations. In particular, the sharp rising curve and the oscilla- 
tions at small scales when polynomial PSF models are used 
cannot be properly modeled by the correlation function cor- 
responding to the log-linear power spectra. As a result, we 
match scales only larger than ~ 3', knowing that in reality 
these smaller scales ( < 3') may not enter in constraining 
cosmology. The resulting n and Ao values as well as the 
corresponding o1 ya values are listed in Table [4] Figure 
shows, for the optimistic case, the two spurious shear cor- 
relation functions overlaid by their functional-form counter 
parts. Note that these grey curves are not fits - they are 
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Table 4. For the 10-year combined r- and i-band data of LSST, 
the best matched n's and Ad's to the spurious shear correlation 
function under different scenarios are listed. The numbers are 
measured with a KSB pipeline and under two different PSF model 
assumptions. Two scenarios for effective number of exposures for 
each field are assumed: the optimistic case corresponds to N e ff = 
368 and the pessimistic case corresponds to N e f{ = 184. 
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Figure 10. The two spurious shear correlation functions in the 
optimistic scenario (solid green) overlaid with functional forms 
assumed by AR08 that visually match the level and approximate 
shapes of the spurious shear correlation (dashed grey). The green 
curves are the medians of 20 different realisations under the fidu- 
cial observing condition, with the error bars indicating the stan- 
dard deviation in the 20 curves divided by \/20. Negative values 
are plotted with open symbols. 



matched visually because the shapes of the measured shear 
correlation functions are quite different from the assumed 
functional forms. 

In Table [4] we show that in the canonical weak lens- 
ing pipeline (KSB + polynomial PSF model), the spurious 
shear power spectrum we have measured from simulations 
is approximately 0.9 - 1.8 times the statistical errors. Al- 
though the numbers imply that by using the current weak 
lensing pipeline, we are already reaching the level of system- 
atics in shear measurements required for LSST, it should 
be understood thatEnot all potential effects (such as shear 
calibratio n, g alaxy modeling, photo-z estimation, chromatic 
PSF effedPjefc.) have yet been included. To ensure that the 
cosmic shear measurements from LSST is not systematics- 
limited after considering all the other systematic errors, we 



14 The PSF shapes measured from the stars is different from the 
PSFs of the galaxies due to the differences between the SEDs of 
stars and galaxies. 
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will need shear measurement methods more sophisticated 
than what we used in this study. 

We can trace the source of these systematic errors to 
improper modelling of the stochastic PSF using polynomial 
functions, which needs to be reduced when developing the 
next generation of shear measurement algorithms. On the 
other hand, if only the non-stochastic errors are modeled, 
the spurious shear due to not modelling the stochastic PSF 
results in a al ys value one order of magnitude greater than 
the target value. This implies that the stochastic PSF com- 
ponents do not average out enough by themselves if not 
corrected, even when the full dataset is combined. We thus 
show the importance of modelling the stochastic as well as 
the non-stochastic components of the PSF. 

We have shown here that given a typical weak lensing 
pipeline, the major physical effects in an LSST observation 
will not seriously limit LSST, provided that the number of 
exposures in the combined dataset and the image quality 
are as expected. 

8.3 Effect of simplifications 

At this point we summarize the major assumptions that 
underlie our analysis to provide context for our results: 

First, we have deliberately designed our simulations and 
the analysis we performed to minimize algorithm-dependent 
contributions to the errors. In particular, we used circular 
Gaussians as our galaxy models, invoked KSB as our shear 
measurement method, and performed an artificial "perfect 
calibration" for the KSB pipeline. Thus the results derived 
in Sections |8.1| and |8.2| only take account of the algorithm- 
independent part of the additive spurious shear correlation 
function. In particular, recent work ( Hirata fc Seljak||2003 



|Refregier et al.| 2012 Mcl chior fc Viola|2012[ ) has shown that 
the algorithm-dependent shear errors are strongly affected 
by noise (the so called "noise bias"), which arises from us- 
ing very low SNR galaxies. In our analyses, this factor is 
suppressed through the use of a simplistic galaxy model. 
However, given the low SNR (~8) of our fiducial galaxy, 
the noise bias for realistic weak lensing galaxies may not be 
negligible. 

Second, we have not taken into account more sophis- 
ticated schemes for combining shear measurements from 
multiple exposures. As suggested by Jain et al. (20061, the 



stochastic component in the shear errors can be eliminated 
by dividing the full dataset into sub-groups of exposures and 
only correlating shear measurements between different sub- 
groups. We provide a brief discussion in Appendix [E] of such 
implementations, but have not investigated the full power 
of these alternative approaches in this paper. 

Third, as mentioned in Section [I] this work is based on 
the projected two-point shear correlation function. In a full 
weak lensing analyses where lensing tomography and higher- 
order statistics are used, additional constraints may arise. 
However, the combination of all these different statistics may 
also be useful in mitigating certain systematic effects. 

Finally, we also note that the applicability of the ex- 
tended Fisher Matrix formulae in AR08 and therefore Equa- 
tion[l9]to our analyses depends on some specific assumptions 
regarding the statistical properties of the spurious shear con- 
tributions we measure in the simulations. The main assump- 
tion in AR08 is that the higher-order statistical properties 



of the spurious shear are similar to those of the true shear 
- a Gaussian random field - so that the covariance matrix 
for the spurious shear power spectrum is close to diagonal. 
(This is implicitly assumed when deriving Equation 10 from 
Equation 9 in AR08.) Since the statistical properties of the 
spurious shear depend on its physical origin, they are not 
guaranteed to be Gaussian. However, in our simulations, 
the effect of such non-Gaussian spurious shear is likely to 
be small compared to the Gaussian component generated 
from counting statistics and the various stochastic effects; 
therefore the results derived in Section 18.21 should be suffi- 
ciently robust. 



9 CONCLUSIONS 

In this paper, we have carried out a bottom-up, quantita- 
tive study of the potential systematic errors in cosmic shear 
measurements for future LSST-like surveys using high fi- 
delity simulations. 

Simulations are generated using PhoSim, a photon-by- 
photon Monte Carlo ray-tracing software that models all 
major physical effects from the top of the atmosphere down 
through the detectors. Specifically, we have generated a suite 
of special simulations in order to isolate the systematic errors 
in ellipticity and shear measurements caused by different 
physical effects, which would have been impossible to achieve 
with a real telescope. 

We identify the most important physical effects in terms 
of their impact on ellipticity measurements and classify them 
into two classes: non-stochastic and stochastic. The elliptic- 
ity errors and their correlation properties caused by each 
individual effect are then quantified in a systematic way. 
We find that, in a single LSST exposure: 

• Ellipticity errors due to counting statistics dominate 
the total ellipticity errors, whereas ellipticity errors due to 
atmospheric and instrumental effects dominate the total el- 
lipticity error correlation function. 

• The ellipticity error correlation function due to non- 
stochastic effects is one order of magnitude smaller than 
that due to stochastic effects. 

For shear measurement, we identify three steps in a 
canonical weak lensing pipeline that lead to spurious shear, 
two of which are dominated by the specific algorithm chosen 
for PSF characterisation and deconvolution, which we have 
not investigated in detail. The third step involves modelling 
the PSF spatial variation with scattered stars. We carry out 
the full analyses with a standard weak lensing algorithm 
and quantify the spurious shear correlation under different 
assumptions about the PSF model. We draw several conclu- 
sions: 

• With perfect PSF knowledge, systematics induced by 
the algorithm in an idealised KSB implementation are neg- 
ligible. 

• Not correcting for the stochastic component of the PSF 
shape introduces large shear systematics in the correlation 
function. 

• A conventional PSF modelling scheme using polynomial 
interpolation of stars can partially model the stochastic PSF 
contribution, but the inflexibility of the functional form of 
polynomials limits the power of this method. 
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The single-exposure results are then extrapolated to 
the full combined 10-year dataset, and finally interpreted in 
terms of the constraints on dark energy parameters accord- 
ing to an extended Fisher Matrix calculation from AR08. 
We draw several conclusions: 

• By using a canonical weak lensing analysis pipeline, the 
systematic errors in the spurious shear correlation function 
induced by the major physical effects, after combining the 10 
years of LSST data, is at a level approaching the statistical 
errors. 

• The errors mainly come from imperfect modelling of 
the stochastic PSF, which has not been studied in detail 
in the past. This calls for better basis functions that can 
characterise structures in the stochastic PSF variation on 
all scales. 

Finally, this analysis is done under several assumptions 
and simplifications, which may need to be taken into account 
when interpreting the results: 

• We have designed the simulations and analysis to avoid 
algorithm-dependence of this analysis. Algorithm errors will 
need to be estimated and combined with the results here to 
yield the total shear systematic errors. 

• A simple scheme is used for combination of shear mea- 
surements in multiple exposures. More intelligent use of the 
data can potentially give better results. 

• Only a projected 2D two-point correlation function is 
analysed. By implementing weak lensing tomography or 
higher-order statistics, some of the spurious shear can be 
mitigated. 

• We adopt the results from AR08 to interpret the spuri- 
ous shear correlation function in terms of its effect on the un- 
certainty in predicting cosmological parameters, which im- 
plicitly assume that the spurious shear has statistical prop- 
erties similar to the true shear. 
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APPENDIX A: 
PHOSIM 



PHYSICAL MODELS IN 



Al Optics and optics perturbations 

PhoSim builds in the most up-to-date optics design of the 
instrument. This includes detailed specifications of the di- 
mensions and wavelength response of each optical element 
from the engineering design (the three mirrors, three lenses 
and the filter), characteristics of the backside-illuminated 
thick CCD detectors, and other telescope components such 
as the shutter and spider, scattered light and tracking mech- 
anisms. The PhoSim version used in this paper is based 
upon the optics baseline design version 3.3. 

In addition to the design, PhoSim also models the level 
of residual wavefront errors after a typical correction from 
the AOS has been made. The effects of these residuals are 
modeled using hundreds of parameters that displace or de- 
form the different optical components, causing the PSF to 
degrade from the design within levels allowed by the engi- 
neering requirements on the AOS. This approach is different 
from an exact simulation of the AOS, which would have to 
take into account the history of the wavefront measurements 
over a period of hours as the survey proceeds. The latter 
approach would require a vast increase in the number of ex- 
posure simulations that are performed, and is thus compu- 
tationally impractical. Therefore, the current PhoSim takes 
the alternative approach of modelling residuals of the full 
control loop instead. Since exposures on the same patch 
of sky are usually separated by a few days, which is much 
longer than a typical time scale for the AOS updates, the 
assumptions involved in this procedure are well justified. 

We further classify the residual wavefront errors, or op- 
tics errors in PhoSim, into two classes. The first class ac- 
counts for errors that originate from fabrication and inte- 
gration, or are introduced by gravity and the thermal envi- 
ronment of the telescope. The former are permanent func- 
tions of the response, while the latter are by nature highly 
repeatable, and the zeroth-order corrections will be imple- 
mented by the AOS according to a pre-calibrated look-up 
table. However, the AOS cannot perfectly compensate for all 
distortions given the limited number of degrees of freedom 
that are actively controlled; therefore, there will be repeat- 
able optics errors, which are "non-stochastic". The charac- 
teristic amplitudes of these perturbations used in PhoSim 
are derived from finite element modelling of the telescope 
and camera under appropriate gravity and thermal loads. 
The second class includes the residual wavefront errors as 



well as actuator errors and wind shake. Since these effects 
are random in nature and do not show repeated patterns 
across exposures, they can be measured only by monitoring 
the wavefront errors in real time. The imperfect correction 
by the AOS for these effects introduces "stochastic" optics 
errors which are uncorrelated from exposure to exposure. 
For the purpose of this paper, we deliberately implement 
the optics model in PhoSim in a way that allows the two 
classes of optics errors to be separated - this is because we 
are interested in how they separately enter into the shear 
measurements, as discussed in more detail in Section [4] 

The typical levels of the two classes of optics errors on 
the major optical elements used in this analysis are listed 
in Table |A1| Note that the numbers corresponding to this 
classification scheme are based on the current engineering 
specifications (Ivezic et al. 2011 and internal documents), 



but the results are easily scalable if different specifications 
are eventually adopted.. 

A2 Tracking errors 

Tracking errors in PhoSim are modeled by a Gaussian ran- 
dom walk of the telescope pointing in each of the three di- 
rections: azimuth, elevation and rotation, where a step is 
taken every 0.1 second throughout the exposure. The effect 
of the tracking error integrated over 15 seconds is to yield a 
root- mean-squared (RMS) error of ~ 0.02" in the azimuth 
and elevation directions, and ~ 1" in rotation. 



A3 The atmospheric model 

Due to the short exposure time of LSST, atmospheric ef- 
fects on image distortions become more pronounced in a 
single exposure than what is usually seen in longer expo- 
sures. Since these atmosphere-induced distortions arise from 
turbulent structures in the air density, we can also expect 
spatial structures in the image shape distortions across the 
field which are associated with the turbulent structures. IDel 



Vries et al. ( 2007 1 first showed via simulations that the effect 



of the atmospheric turbulence on PSF shape distortion aver- 



ages out rapidly with exposure time, while Wittman ( 2005 \ 
andlHeymans et al. (20121 measured the correlation of PSF 



shape distortions in short exposure data and discuss their 
potential effect on weak lensing. 

Similar to the pure atmospheric simulations in |De Vries| 
|et al.| ( |2007| , the atmosphere in PhoSim is modeled by 
multiple layers of moving atmospheric screens. The "frozen 
screen approximation" is justified since the time scale for 
the shapes of turbulent cells to change significantly is much 
longer than the time required for turbulence cells to pass 
through the field of view, given the typical wind speeds of a 
few meters per second ( |Taylor|1938} |Poy nccr et al.|2009| >. 

The heart of the atmospheric model is a set of multi- 



scale, multi-layer frozen Kolmogorov screens (Kolmogorov 
|1992[ ). These atmospheric screens are constructed according 
to a full 3D Kolmogorov spectrum with assigned parame- 
ters including the structure function, inner scale, outer scale, 
wind speed and wind direction. All parameters are modeled 
from existing atmospheric data taken near the LSST site. 



In Peterson et al. (2012), we explain the theoretical justifi- 



cation of the approach we have taken, as well as the major 
innovations in the PhoSim atmospheric model. 
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Table Al. Specifications of the major optics errors modeled in PhoSim. We describe the surface height variations on the mirrors using 
2nd- through 5th- order Zernike polynomials, where each polynomial is normalised individually with some amplitude according to typical 
values observed in existing systems. We list here only the range for the four amplitudes used. 



Source of error 



Stochastic or not 



Random sampling 



Physical parameter 



RMS values 



Solid-body optics errors 
in the mirrors 



non-stochastic 
stochastic 



Gaussian 
Gaussian 



misalignment 
tip/tilt 

misalignment 
tip/tilt 



1.08 X 10 -2 (mm) 
2.34 X 1CT 6 (rad) 
1.89 X 1CT 2 (mm) 

4.09 X 10" 6 (rad) 



Surface height variation 
in the mirrors 



non-stochastic 
stochastic 



Gaussian 



coefficients of Zernike 
polynomial expansion 



(0.38 - 2.20) xlO -4 (mm) 
(0.76 - 4.40) xlO -4 (mm) 



Solid-body optics errors 
in the camera 



non-stochastic 
stochastic 



Gaussian 
Gaussian 



misalignment 
tip/tilt 

misalignment 
tip/tilt 



3.24 X 10 -2 (mm) 
1.58 X 10" 4 (rad) 
5.67 x 10~ 2 (mm) 
3.77 x 10" 4 (rad) 



Sensor surface displacement 
from ideal focal plane 



non-stochastic 



Zernike expansion based height 
on laboratory measurements variation 



5x10 3 (mm) 



APPENDIX B: KSB FORMULAS 

The following formulas are the foundation for performing a 
weak lensing measurement using the KSB algorithm. We use 
the Einstein summation convention. 

First, stars and galaxies are measured with the "get- 
shape" imcat routine, which assigns to each object the 
following shape parameters: the complex ellipticity (s a ), 
the smear polarisability (P^p) and the shear polarisability 
{Paf)i where a, P = 1, 2. e a is defined in Equations [l] and JiJ 
while P^™ and P*p are calculated through 

a0 — *-ap — £a£/3 , (til) 

rysh -ysh sh {TiO\ 

■r a — A aP — £a£{j ■ ( tiZ ) 

X£p, X£p, e 3 ™ and ef" are derived through combinations 
of weighted second moments of the light profile of the object 
f(xi,X2) and derivatives of the weighting function W{xi,X2) 
(see Kaiser et al.||1995 for definitions of these quantities). 



Each component e a , P^™ and P^p for the stars is inter- 
polated to the galaxys' locations and recalculated with the 
galaxys' weighting functions. The remeasured quantities will 
be used to construct the PSF model for the galaxy. 

The anisotropic PSF effects on the galaxy's ellipticity 
are first corrected through 



where 



p a = (P 



* ,sm \ — 1 * 

] a p£p 



(B3) 



(B4) 



The superscript "*" indicates parameters of the PSF model. 
Shear g a changes the galaxy's ellipticity by 

Se a = P^gp ■ (B5) 

Finally, we need to correct for the weighting and circular 
seeing effects to get the final shear estimate for each galaxy 
by replacing P^ with the shear susceptibility P^p, where 



p7 _ psh 



rySm f r>* ,s m \ — 1 ry*,sh 



(B6) 



P^p is eventually replaced b y \tr\P 1 ] in o ur approach, sim- 
ilar to the "ES2" method inlMassey et al.l (120071. 



APPENDIX C: 1/N SCALING FOR THE 
STOCHASTIC SPURIOUS SHEAR 
CORRELATION 

To demonstrate the 1/iV scaling for the stochastic spurious 
shear correlation, we use the 20 simulated shear catalogues 
used in Section [7. 3| where we have argued that most of the 
shear errors in these catalogues are stochastic. We then con- 
sider averaging shear measurements of the same galaxy in 
the first N different frames, and calculate the shear correla- 
tion function for the averaged shear catalogue as a function 
of N. 

The results for N = 1, 5, 20 are shown in Figure [Clj We 
observe that, although the low statistics causes the data to 
be quite noisy, the spurious shear correlation function does 
roughly follow the 1 /N scaling. This supports our argument 
in Section |8.1| where we extrapolate our single-exposure 
measurements to the full 10-year LSST dataset. 



APPENDIX D: 
EXPOSURES 



EFFECTIVE NUMBER OF 



In the 10-year period of observation planned for LSST, ev- 
ery patch of sky is imaged approximately 386 times, each 
with 15-second exposures (the number doubles to 772 if r 
and i bands are combined); however, because of variation 
in the observing conditions and the galaxy properties them- 
selves, not all galaxies have good shape measurements in all 
exposures. The power of combining a multi-epoch dataset is 
that one has the freedom to weight the contributions of a 
single galaxy shape measurement in each of the different ex- 
posures according to the image quality, and thus extract the 
maximum information from the noisy dataset. This effec- 
tively means that the "stochastic" spurious shear correlation 
will not cancel as fast as 1/772 under this framework, be- 
cause some of the exposures will be down-weighted due to 
their poor image quality. Instead, the spurious shear corre- 
lation will decrease only as fast as 1/N e g, where N b h is the 
"effective number of exposures" for the entire dataset. 7V c fl 
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Figure CI. The spurious shear correlation function for the mean 
shear catalogue for TV exposures, where N = 1 (solid black), 
5 (dotted green) and 20 (dashed red). The shear catalogue is 
obtained via standard KSB algorithm and 5th-order polynomial 
PSF models. Negative values are plotted with open symbols. 



depends on the detailed distribution of the observing con- 
ditions, the galaxy distributions (brightness, size, redshift), 
as well as the analysis pipeline used. The precise determi- 
nation of N e fs is complicated and beyond the scope of this 
paper; however, we can provide optimistic and pessimistic 
estimates for N e g to bound the final results in a reasonable 
range. 

In this work, we have chosen the fiducial observing con- 
ditions (Table [T| to correspond to roughly the median con- 
dition of the best 50% of the dataset in terms of image qual- 
ity. This gives approximately 184 exposures in the r band 
as well as the i band, or ~368 exposures for the entire weak 
lensing dataset for each field. We view this as an optimistic 
estimation, since it has been shown ( Hirata fe Seljak||2003 



|Refregier et"aTpoT2{ |Melchior fc Viola||2012[ ) that for PSF 
sizes approaching the size of average galaxies (for part of 
the 368 exposures), the systematics can grow nonlinearly 
and the median spurious shear of these 368 exposures can 
be larger than that shown in Figure[8] Furthermore, as men- 
tioned in Section [4.1 1 errors in the 2 exposures in the same 
visit may be correlated, effectively lowering iV ff for the full 
dataset. A more conservative estimation is to use only the 
best 25% of the exposures to make cosmic shear measure- 
ments. In this case we have a total of 184 exposures and the 
results are likely to be pessimistic, since the median spurious 
shear correlation of these 184 exposures is likely to be better 
than those measured in Figure [8] We can assume that the 
optimal outcome of combining the full 10-year data lies in 
between these two bounds N e g = 184 and iV c ff = 368. 



across exposures, only the non-stochastic systematics re- 
main, which do not scale down further with number of expo- 
sures. This, however, comes with the price of decreasing the 
statistical power of a dataset, since there will be a smaller 
number of pairs that contribute to the correlation function. 
The statistical errors are increased by \J N c tf / (N c ft — 1). 

For LSST, the full 10-year dataset includes N eB =184 
- 368, which means that implementing cross correlation will 
not degrade the statistical power significantly. However, at 
earlier stages of the survey, when N c fj is still small, the 
V NeS I (N e fE — 1) penalty may overcome the benefit of im- 
plementing cross correlation. For the purpose of this paper, 
we will not discuss the details of optimising the combined 
multi-epoch dataset; rather, we show one simple example 
for implementation of cross correlation and merely demon- 
strate an alternative way of combining multiple exposures 
to improve upon our results. 

For the 20 simulations in the master set in Section[7j we 
take the spurious shear obtained from the 5th-order polyno- 
mial PSF models and calculate 10 correlation functions by 
correlating shear measurements for galaxies in two different 
exposures (the 20 exposures were split into 10 groups of im- 
age pairs, for which the shear measurement in each image 
is correlated only with shear measurements in the other im- 
age). The median of the 10 correlation functions is plotted in 
Figure |E"Tj together with the shear correlation functions in 
Figure [8] scaled to N e g = 2, so that all curves represent the 
spurious shear correlation function for combining a dataset 
of two independent exposures. Note that in Figure |ET) the 
shear correlation function from correlating galaxies in dif- 
ferent exposures is essentially consistent with zero at angu- 
lar scales larger than ~ 3', but rises steeply smaller angular 
scales. This may be because the PSF models are consistently 
ill behaved on the edges of the CCD sensors from the poly- 
nomial PSF model, making some of the PSF model errors 
"non-stochastic" between frames. This is likely to be an un- 
realistic artifact since no rotation/dithering is used in the 
20 images. 

This suggests that correlating galaxies across different 
exposures can suppress some of the stochastic systematics. 
But similar to the auto-correlation technique, the calcula- 
tion depends on a good PSF modelling process that does 
not create artificial non-stochastic errors. Given the loss of 
statistical power by correlating across exposures, a more de- 
tailed investigation of these tradeoff's will be needed to es- 
tablish the optimal use of LSST's multi-epoch dataset. 



APPENDIX E: CORRELATING SHEAR 
MEASUREMENTS ACROSS EXPOSURES 

In addition to the simple averaging scheme discussed in Sec- 
tion 



8.1 



previous papers (see e.g. Jain et al. 2006) also 
have suggested correlating galaxies in different exposures to 
eliminate the stochastic systematics in the atmosphere and 
instrument. In theory, by correlating shear measurements 



22 C. Chang et al. 




O(arcmin) 

Figure El. For two typical LSST exposures combined, the spu- 
rious shear correlation function when a standard KSB algorithm 
is implemented with 5th-order polynomial PSF models and only 
galaxies from different exposures are correlated (black). Com- 
pared with the simple average scheme (dashed green), correlat- 
ing galaxies from different exposures suppresses mainly spurious 
shear correlation on large scales. The error bars for the black 
curve indicate the standard deviation in the 10 curves divided by 
\/l0. Negative values are plotted with open symbols. 



